Straight Line Fitting { a Bayesian Solution
نویسندگان
چکیده
Fitting the \best" straight line to a scatter plot of data D f(x1; y1) : : :(xn; yn)g in which both variables xi; yi are subject to unknown error is undoubtedly the most common problem of inference faced by scientists, engineers, medical researchers, and economists. The problem is to estimate the parameters ; in the straight line equation y = + x , and assess the accuracy of the estimates. Whenever we try to discover or estimate a relationship between two factors we are almost sure to be in this situation. But from the viewpoint of orthodox statistics the problem turned out to be a horrendous can of worms; generations of e orts led only to a long line of false starts, and no satisfactory solution. We give the Bayesian solution to the problem, which turns out to be eminently satisfactory and straightforward, although a little tricky in the derivation. However, not much of the nal result is really new. Arnold Zellner (1971) gave a very similar solution long ago, but it went unnoticed by those who had the most need to know about it. We give a pedagogical introduction to the problem and add a few nal touches, dealing with choice of priors and parameterizations. In any event, whether or not the following solution has anything new in it, the currently great and universal importance of the problem would warrant bringing the result to the attention of the scienti c community. Many workers, from astronomers to biologists, are still struggling with the problem, unaware that the solution is known.
منابع مشابه
On the Cost of Approximating and Recogni?ing a Noise Perturbed Straicht Line or a Ouadfatic Curve Segment in the Plane
Approximation of noisy data in the plane by straight lines or elliptic: or single-branch hyperbolic curve segments arises in pattern recogniticn, data compaction, and other problems. A number of questions concerning the efficient search fo_ and approximation of data by such curves are examined. Fecursive least-squares linear curve-fitting is used, and ellipses and hyperbolas are parameterized a...
متن کاملCho 1 Statistic Literature Review – OLS Regression
1. A brief summary of the statistical method (1) What is the method? Regression is a statistical technique used for modeling and analysis of numerical data by finding the best-fitting straight line. In other words, a regression line is defined the combination between the average values of a numerical outcome variable and deterministic variables. The ways to find a best fitting straight line, es...
متن کاملSkew detection and correction in document images bsed on straight-line fitting
During document scanning, skew is inevitably introduced into the incoming document image. Since the algorithms for layout analysis and character recognition are generally very sensitive to the page skew, skew detection and correction in document images are the critical steps before layout analysis. In this paper, a novel skew detection method based on straight-line fitting is proposed. And a co...
متن کاملA Least - Squares Straight - Line Fitting Algorithm with Automatic Error Determination
We demonstrate a new generalized least-squares fitting method which can be used to estimate the slope of the best-fitting straight line that results when two separate data sets which are expected to be linearly correlated are subject to different uncertainties in their measurements. The algorithm determines not only the optimum slope, but also produces estimates of the intrinsic errors associat...
متن کامل